Dialect classification using acoustic and linguistic features in Arabic speech

نویسندگان

چکیده

<span lang="EN-US">Speech dialects refer to linguistic and pronunciation variations in the speech of same language. Automatic dialect classification requires considerable acoustic differences between different categories speech. This paper proposes a model composed combination classifiers for Arabic by utilizing both features spontaneous The comprises an ensemble focusing on frequency ranges within short-term spectral features, as well classifier ‘i-vector’, whilst use extracted transformer models pre-trained large text datasets. It has been shown that proposed fusion multiple achieves accuracy 82.44% identification task five dialects. represents highest reported dataset, despite relative simplicity model, its applicability relevance tasks. </span>

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dialect-based speaker classification of Chinese using acoustic features invariant with extra-linguistic factors

Chinese dialects-based speaker classification using modern speech technologies is really a challenge, not only because the situation of Chinese dialects is very complicated, but also because acoustic features of utterances convey dialectal information together with extra-linguistic information such as age, gender, speaker, etc. In this paper, we propose a new speaker classification technique us...

متن کامل

Emotion classification in children's speech using fusion of acoustic and linguistic features

This paper describes a system to detect angry vs. non-angry utterances of children who are engaged in dialog with an Aibo robot dog. The system was submitted to the Interspeech2009 Emotion Challenge evaluation. The speech data consist of short utterances of the children’s speech, and the proposed system is designed to detect anger in each given chunk. Frame-based cepstral features, prosodic and...

متن کامل

Measuring Norwegian dialect distances using acoustic features

Computational dialectometry has been proven to be useful for finding dialect relationships and identifying dialect areas. The first to develop a method of measuring dialect distances was Jean Séguy, assisted and inspired by Henri Guiter (Chambers and Trudgill, 1998). Strongly related to the methodology of Séguy is the work of Goebl, although the basis of Goebl’s work was developed mainly in dep...

متن کامل

Arabic Dialect Identification in Speech Transcripts

In this paper we describe a system developed to identify a set of four regional Arabic dialects (Egyptian, Gulf, Levantine, North African) and Modern Standard Arabic (MSA) in a transcribed speech corpus. We competed under the team name MAZA in the Arabic Dialect Identification sub-task of the 2016 Discriminating between Similar Languages (DSL) shared task. Our system achieved an F1-score of 0.5...

متن کامل

Automatic Dialect Detection in Arabic Broadcast Speech

In this paper, we investigate different approaches for dialect identification in Arabic broadcast speech. These methods are based on phonetic and lexical features obtained from a speech recognition system, and bottleneck features using the i-vector framework. We studied both generative and discriminative classifiers, and we combined these features using a multi-class Support Vector Machine (SVM...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IAES International Journal of Artificial Intelligence

سال: 2023

ISSN: ['2089-4872', '2252-8938']

DOI: https://doi.org/10.11591/ijai.v12.i2.pp739-746